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Abstract 

This paper addresses finite sample stability properties of sequential Monte Carlo methods for approx- 
imating sequences of probability distributions. The results presented herein are applicable in the scenario 
where the start and end distributions in the sequence are fixed and the number of intermediate steps is 
a parameter of the algorithm. Under assumptions which hold on non-compact spaces, it is shown that 
the effect of the initial distribution decays exponentially fast in the number of intermediate steps and 
the corresponding stochastic error is stable in hp norm. 
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1 Introduction 

Sequential Monte Carlo (SMC) methods are a class of stochastic algorithms for approximating sequences of 
probability measures using a population of N particles. They have been adopted in a variety of application 
domains, including rare event analysis [8j, statistical physics [22, optimal filtering |18| 12] and computational 
statistics [3, llj. Various theoretical properties of SMC methods have been studied, and in various contexts, 
see amongst others [U [531 [T31 |5S] , and the seminal work of Del Moral [6 . Existing stability results for SMC 
methods rely on very strong assumptions which typically do not hold on non-compact spaces. This article is 
concerned with establishing stability properties for a class of SMC algorithms primarily motivated by those 
of Del Moral et al. [11 , under assumptions which do hold on non-compact spaces. The following example 
indicates a scenario of interest. 
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1.1 A motivating example 

Let TT be a distribution on some space X, admitting a strictly positive and bounded density with respect to 
some dominating measure. For ease of presentation, let tt also denote this density and let vf denote the corre- 
sponding unnormalised density, i.e. for some Z > 0, 7r(x) ~ Tt{x)/Z. For some 7 £ (0, 1), let 7 : [0, 1] — > [7, 1] 
be a non-decreasing Lipschitz function with 7(0) = 7 and 7(1) = 1. For n e N let {Gn.k', < fc < n} be the 
collection of potential functions on X defined by Gn,k{x) = 7f''(('=+i)/")(a;)/7f''('=/")(a;). Let {MnX, I < k < n} 
be a collection of ergodic Markov kernels also on X, where Mn^k admits as its invariant distribution the prob- 
ability with density proportional to tt'^*^'^/"-' (x) (denoted by 7r^(fe/ri))- For some initial distribution /i, consider 
the sequence of probability distributions {r/n,k', 1 < k < n} defined, for a test function /, by 



Vn,k{f) ■■= r~~, 1 ' (1) 







=0 Gn,k{Xn,k)f{X 


ri,n ) 




Y\1=0 Gn,k{Xn,k) 





where denotes expectation with respect to the law of the inhomogeneous Markov chain {Xn,k'-, < k < n}, 
such that Xq ^ /i and Xn^k+i\^n,k = Xn.k ^ ^n.k+i {xn,ki')- The special interest in (jlj is that when 
/i = TT^, then ?7„ „ = tt, for any n > 1 . Furthermore, in applications Z is unknown, the distributions of 
the form ([ij cannot be computed exactly, and one aims to obtain an approximation of tt by fixing n then 
approximating each of the {r^n.k] < fc < n} in turn, as follows. Let G N and let {Cn,fc; < fc < n} be an 
inhomogeneous Markov chain, with each C,n,k = fc; 1 < « < an A^-tuplet and with each j. valued in 
X, with {Ci,o; 1 < * < ^} independent and of common distribution ^. Given C„_fc, the \Cn,k+i 
are independent, with ^.j^^ drawn from 

SjLl Gn,k iin^k) Mn,k+1 {^n^k^ 

= l Gn,k iiii.k) 

The particle approximation measure at time step k is 77^ := ;^ j.' takes rjnnif) '■— 

jj X^iLi /(Cn n) as an approximation of 7r(/). The expectation terms in Q are in the shape of Feynman-Kac 
formulae, and adopting the terminology of Del Moral [6_ throughout the following, a general collection of 
Maxkov kernels, potential functions, initial distribution and associated {rin,k} are referred to as constituting 
a Feynman-Kac (FK) model. 

The FK model described above has a notable structural characteristic: due to fact that 7 and tt are fixed 
and 7(-) is continuous, the potential functions each become fiat as n — > cx3 (note that this is not an essential 
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feature of the general FK models considered by Del Moral [5^ , nor is it the regime usually considered in the 
filtering scenario). One might then conjecture due to this "flattening" property that the sequence of measures 
{'7Ti,fc; < fc < n} inherit ergodicity properties from the Markov kernels when fi ^ n^y, and rinn{f) — 7r(/) goes 
to zero at some rate as n — > oo. Perhaps more adventurously, one might further conjecture that this stability 
property is inherited by the corresponding particle system so that [|^^n(/) ~ ""(Z)!^] controlled 
uniformly in n, and diminishes at the usual \/N rate, where denotes expectation w.r.t. the law of the 
particle process initialised using /i. 

The results presented in this paper allow it to be shown that this is indeed the case under assumptions 
which are realistic in the context of applications such as on non-compact spaces. The contributions are 
to establish deterministic stability results for a broad class of FK models, of which the example above is one 
instance, and to provide Lp bounds for the corresponding particle errors. 

1.2 Summary of results 

The present work is built upon generic assumptions about the FK model structure, which can be loosely 
summarized as follows: 

• the Markov kernels {Mn^k} are geometrically ergodic, with common Foster-Lyapunov drift function V 
and associated constants, and with a common minorization condition 

• the potential functions {Gn,k} are uniformly bounded above, and are of the form Gn,k{x) oc exp [— ^C/n,fc(a;)] , 
for Un,k positive and bounded uniformly over n and k in y-norm . 

• / is bounded in 1^-norm 

Stability properties of general FK semigroups are established in Theorem [l] (section [S]) . Note here that, in 

contrast to the above example, no speciflc form is assumed for {Un,k} or their relations to the invariant 

distributions of {M„_fe}. Theorem [2] (section |4]) provides Lp error bounds for the corresponding particle 

systems, under additional assumptions on the drift properties of the particle process. 

For the reader's convenience Theorem [s] is now summarized (for the precise statement see section |5|, 

which is an application to the example sketched above. Let X — . Then when tt has a sub-exponential 

density w.r.t. Lebesgue measure with asymptotically regular contours, and when each Af„jt is a random 

walk Metropolis kernel of invariant distribution 7r^(fe/„), under a suitable trade-off between p > 1, a G [0, 1) 

|/(a;)| 

and 7, there exists p < 1 and constants Ci and C2 such that for any / with ||/||yc, sup^. < +00, 

— V (x) 
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any > 1 and n > 1, 

K(/)--(/)r]'^' < 11/11^. (^(\^)+p"C2l[M^7rj), (2) 



where := Vnm the particle approximation as in sectionjlj The first term on the r.h.s. of (|2j) corresponds 
to the stochastic errors from the population of size N interacting and mutating over n times steps. The 
second term on the r.h.s. is a deterministic bias which arises if the initial distribution is mis-specified, and 
this bias is decays exponentially quickly in n. 

1.3 Existing work 

Despite the fact that the focus here is on models with the "flattening" property, existing SMC stability results 
cannot be transferred directly to the present scenario under realistic assumptions. They are reviewed below 
for completeness. 

Del Moral [6, Theorem 7.4.4] proved time-uniform Lp error bounds, under assumptions which in the 
present scenario would take the form: 

• there exist ejv/ > 0, m > and > such for all n, k and a;, y G X 

Mn,k ■ ■ ■ Mn,k+m{x, ■) > ej\/Af„,fe . . . Mn,k+miy, ') (3) 

Gn,k{x) > eGGn,fc(2/), (4) 



• / is bounded. 

Note that these results hold for fully general FK models, not necessarily having the "flattening" property (see 
also glOl [71 E51 [51 [T^ for various results under the same type of assumption as one or both of (|3|-(|4])). 
However, these assumptions are very strong. Equation ([S]) is stronger than uniform ergodicity of the m-step 
kernels, and typically is not satisfled for the kernels of interest in [llj, such as Metropolis-Hastings kernels 
on M'*. For a toy example which highlights the issue, consider the case that X = M and for some probability 
measure on X dominated by Lebesgue measure, take Mn^k{x, ■) — a5x{-) + (1 — a)v{-). It is easy to check 
that (|3| is violated. Similarly Q is typically not satisfled in the applications of interest on non-compact 
spaces and the assumption that / is bounded is then also rather restrictive. 
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Oudjane and Rubenthaler [5S] and Heine and Crisan [12] used truncation approaches to obtain stability 
results for particle filters in expectation over the observation process, without mixing assumptions, but they 
respectively introduce a rejection step into the particle algorithm (making its computational cost random) 
and use restrictive assumptions about the state-spaces involved, the hidden Markov model (HMM) and the 
particle mutation kernels which are not realistic in the scenario of interest, van Handel [28_ proved uniform 
time-average consistency of particle filters under tightness assumptions for a class of bounded functions. Very 
recently, Del Moral et al. fT51 have studied SMC methods in which the resampling step is applied adaptively 
over time. 

The stability of SMC methods has also been studied in the asymptotic regime N —i' oo. Chopin [l] 
established a CLT for a broad class of SMC algorithms, and showed that under the same type of strong 
mixing assumptions as in ([3|-([4|, the asymptotic (in N) variance associated with the rescaled stochastic error 
can be bounded uniformly in n. Jasra and Doucet [21_ focused on the asymptotic variance corresponding to 
the algorithms proposed by Del Moral et al. [11 for unbounded functions. They obtained a bound on the 
asymptotic variance under realistic geometric ergodicity assumptions which are the same as those considered 
in the present work. They do not consider the "flattening" regime in their assumptions, and their bounds on 
the asymptotic variance are not time uniform. Further details of the relationship between the approach of 
Jasra and Doucet [21, and some of the ideas in this paper are postponed until section |3] 

It is relevant also to mention the recent results of Kleptsyna and Veretennikov [22] , Douc et al. [15] on 
forgetting of initial conditions in HMM's (i.e. without particle approximation) assuming a local Doeblin 
condition on the Markov kernels: this also does not hold in the Metropolis-on-M'' scenario of interest and 
so these results cannot be transferred directly (the results of Douc et al. [17_ do not require ergodicity of 
the kernel, but their assumptions do involve the same minorization/majorization structure). Douc et al. |16| 
proposed a generic approach to optimal filter stability without particle approximation using ideas of coupling 
inhomogeneous Markov chains and a similar approach is exploited here, further comment is delayed until 
the later sections. 

The remainder of the paper is structured in the following manner. Section |2] specifies the general form of 
the FK models in question, associated semigroups and particle systems. Section[3]deals with the deterministic 
stability of the sequences of measures arising from the FK models. Lp error bounds for the stochastic errors 
of the particle approximations are derived in Section [4] Section |5] applies the results to the case where 
X = M'* and when the Markov kernels are of the random walk Metropolis variety. The appendix contains a 
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discussion of drift properties of the particle system. 



2 Definitions 

Consider a state space X and an associated countably generated cr-algebra B{X), Let 'P{X) be the collection 
of probability measures on (X,,B(X)). For a measure /i on {X,B{X)), an integral kernel Af : X x B{X) — > 
[0,00) and a function f : X ^ R, define fif :— J.^ f(x)iJ.{dx), Mf{x) := Jy^ M {x , dy) f (y) and liM(-) = 
fy^fi{dx)P{x, •) . The function which assigns 1 to every point in X is also denoted by 1, and the indicator 
function on a set A is denoted by I^. 

Let W :X ^ [1, 00) and / : X ^ M be two measurable functions, then define the norm 

||/|k=supi|M. 

and let Cw — {/ :|| / \\w<- 00}. Let /i be a signed measure on (X, S(X)), then define the norm 

\\ nWw = sup 1/1(0)1 . 

\<j>\<W 

2.1 Feynman-Kac models and associated semigroups 

Let /i € 7^(X) and for each n e N let {M^ k; 1 < fc < n} be a collection of Markov kernels, each kernel acting 
Xx B{X) — > [0, 1]. Let {Gn,k', 0<A:<n — l}bea collection of S(X)-measurable, real- valued, strictly positive 
and bounded functions on X. The notation employed below is directly inspired by that of Del Moral [6J, with 
some important modifications, primarily to the indexing, refiecting the scenario of interest in which there is 
a different FK model for each n. 

Next, for each n e N, let {Qn,k', 1 < A: < n} be the collection of integral kernels defined by 

Qn,kix,dy) = Gn,k^i{x)Mn^k{x,dy). 

For each n and < fc < £ < n, let M^.k-i and Qn.ki the semigroups associated with the Markov kernels 
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{Afn.fc} and the kernels {Qn,k}- These semigroups are defined by 

Mn,k:i = M„,fc+lM„^fc+2 ■ • • Mn^l, k < I < 71, 

Qn,k-1 = Qn,k+lQn,k+2 ■ ■ ■ Qn/, k < £ < 71, (5) 

and Mn^k:k — Qn,k:k = Id, < k < 71. Next define the collection of probability measures {7in,k', < k < n} 
by 

, ^J■Qn,0■.kiA) 
VnM^) = —F, 7TT' ^ ^ '^W- 

Let \^n.k \ < fc < n} and \^n.k \ 1 < A: < n} be the collections of mappings, each mapping acting from V (X) 
into V (X) , defined for any rj E V (X) by 

G fx) 

'fn,kiv)idx) = I vidx), ^n.kiv) = 'i'n,k-l{v)Mn,k, 

and for < k < i < 7i denote by ^n,k:i the semigroup associated with the mappings {^n,k}, defined by 

^n,k:e = '^njO'^n,e-lO ■■■O^n.k+l, k < I < 71, 

and with the convention ^n.k-.k = Id. It is straightforward to check that under these definitions, for any 
< A: < ^ < 71, 77 e -P (X) and A e B{X), 

<^n,k:dV)iA) = TTT (6) 

■nQn,k-A^) 

and in particular, 

^n,k:e{Vn,k) = Vn,e- 

Lastly, let {Sn,k', 1 < A: < n} be the collection of Markov kernels, each kernel acting X x B{X) [0, 1], defined 
by 

(1)Ia) 

Under these definitions it is straightforward to check that, in line with (|6|, we have the alternative 
description of the mapping ^n,k:n in terms of the Markov kernels {S„^k}'- for any < k < ti and any 



7 



V € ^(X) 



^n.k:n(r])[A) = — . (7) 

V \Qn,k:n(l)) 

Consider the following assumption. 
(Al) There exists a finite constant Gx such that for all n e N and < fc < n — 1, 

Gn,k{x) < Gx, Vx e X. 
When assumption holds, for < fc < n — 1 define Gn,k : X -> (0, 1] by 

P5 / N _ Gn,k{x) 
'~rn,k[X) — = , 

Gx 

and correspondingly, 

Qn,k{^,dy) = G,i,k-i{x)Mn^k{x,dy). 

Also let Qn.k-.e denote the semigroup associated with the kernels Qn.k in the same manner as with the 
same convention Qn,k:k = Id. Furthermore under (-A[i]), let ?7„^fc : X — )■ [0, oo) be defined by Un^k{x) = 
-nlog Gn,k{x). 

As stated in section ^ the work presented here is primarily motivated by the models and algorithms 
considered in |1 1 j . However, the "backward" kernel structure which they consider is not introduced here as 
it is not essential for our purposes. A specific example is given in section [5] and at that point comment on 
how this fits with the framework of Del Moral et al. jTT] is provided. 

2.2 Particle systems 

An explicit construction of the probability space for the particle systems is not provided here, but this 
can be carried out by canonical methods, see for example jHl Chapter 3] and should be clear from the 
following symbolic description. Fix iV G N, n G N, and for < fc < n let C^^^^^ :— 1 < * < where 

each 'Ci^'*'' is valued in X. Denote rj^^, := — ^^<™;') • For 1 < i < iV and 1 < fc < n let J-^^if^ '■= 

■ • • >d^-i'd^/\ ■■•'d^fe''') and J"^^'*^ := c^(d^o'^^ d^o''')- The generations of the particle system 
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{ci^^O < fc < ?i| form a non-homogeneous Markov chain: for ji G 'P(X), the law of this chain is denoted 
by and has transitions given in integral form by: 



4=1 

{C^dx\cZl,) = ll^n,, K,-,) l<k<n, 



i=l 



where dx — d (x^, . . .x^). Denote by the expectation corresponding to P^. It is easy to check that for 
any Q <k < n andl < i < N, 



^ ^,[<k{A)], AeBiX). (8) 



3 Stability of the deterministic measures 

This section is concerned with stability properties of the sequences of Markov kernels {Sn.k} and operators 
{^n,k}- The approach is to identify non-homogeneous Foster-Lyapunov drift functions and minorization 
conditions which arise quite naturally from the structure of the FK model and then to employ the quantitative 
bounds of Douc et al. [14J. Compared to [21], in the present work the general structure of FK models is 
exploited more directly and in a way which is fruitful when the models satisfy assumption (A|4]) below. Douc 
et al. jTO] Section 5] identified drift functions and coupling sets for related operators in some specific HMM's; 
the present work is concerned with a general FK model structure. The first main idea of this section is 
illustrated by the following assumption and lemma. 

(A2) There exists A G [0, 1), a function V : X ^ [l,oo), e e (0, 1], b e (0,oo), C G 6 (X) and a probability 
measure v G ViX), such that 



inf inf M„,fe(x,A) > e-iy{A), VA e S(X), Va; € C, (9) 

n>l l<k<n 

sup sup Mn,kV{x) < XV{x) + blc{x), Vx e X. (10) 

n>l l<k<n 
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Lemma 1. Assume (J^-(J^. Then for each n eN and 1 < k < n, 

Sn,k{^r) > f-n,kVn,k{-), Va; G C, 

Sn,kVn,k{x) < XVn,k-l{x) +bn,k-l^c{x), Vx G X, 

where 



e„ := e -v 



f ^n,fc:n (1) J ; ^n,fc • — / \ ; 

^ ^ S ■ [Qn,k:n{l)) 



and Vn,k G (X) and Vn,k ■ X — )• [l,oo) are defined by 



Qn.fe:n(l)lA, 
V ( Q„,fe:„(l)J 

:= rj^^^^^ r , x G X, < fc < 7i, 



V^^n := V , with X, V, b, v, e and C as in (J^. 

Proof. Noting that < Qn,k:n{^){x) < 1 for all x G X, we have 



Sn,k{x, A) 



-^^n,fc [ Qn.k:n {m){x) ,^ . 
\^ >M«,fc Q-,fe:n(l)lA (a;) 



(Qn,fc:n(l) 



(<5«,fe:«(l)II 



>e-J^ Qn.fc:n(l) T^. X^ 



and 



< (x) + — 7^5§^^^ ^ ^ 

e ■ \ Qn.k:n(S) 

recalling the convention (5„, „:„(!) = 1. 
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The minorization and drift conditions (11|-(12| pave the way to establishing the stabihty properties of 
the operators {$„,fe}. In order to move further we introduce assumption (AjsJ) below, which is a stricter 
version of . The extra structure of (A[3| allows the construction in Proposition [l] of bi-variate drift and 
minorization conditions. Consideration of the case in which the small set arises as a sub-level set of the 
drift function is a standard and generic approach to constructing bi-variate drift conditions from their 
uni-variate counter-parts, see for example [HIT?]. Furthermore, assumption (A[3]) allows the level in question 
to be chosen in a very flexible way, and this property is exploited in Proposition [T] when dealing with the 
specific structure arising from the kernels {Sn,k}- Verification of (i^|3| in a particular application is provided 
in section [5j Assumption (A[4]) allows the bounding of non-homogeneous minorization and drift constants. 
A generic approach to verifying (A[4| is presented at the end of this section, where the connection with 
fiattening property mentioned in the introduction is made more explicit. 

(A3) There exists do > 1, X E [0,1), a function V : X ^ [l,oo), and for all d £ [do, +00), there exists 
Ed e (0, 1], bd e (0, 00) and i^d e 'P(X) such that i^d (C^) > 0, Vd{V) < +00, 

inf inf Mn^kix,A) > Ed-M^), VA e 6(X), Vx e Q, (17) 

n>l l<fc<n 

sup sup Mn.kV{x) < XV{x) + bdlcAx), yxeX, (18) 

n>l l<fe<n 

where :— {x : V{x) < d}. 

(A4) Whenever {J^ and {J^ hold, for any e V {X) with ^{V) < +00, and d e [do, +00) there exists a 
positive and finite constant K (/z, A, V, bd) such that 



inf inf fJ- {Qn.k-.nC^)) > K{n,X,V,bd), 



where A, V and do are as in (A[3|). 
The main result of this section is now presented. 

Theorem 1. Assume (jQ, (jQ and (j^. Then for a G (0, 1], th ere exists p £ (A, 1) and a finite constant 
M such that for any /i, /i' G ViX), any n e N and any 1 < k < n, 

ySn,k ■ ■ ■ S„,n - • ■ • SnAv^ < M p^-^^' [p (V;"fe-l) + , (W) 
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where Vn.k-i is as given in equation {15), and consequently, for <k <n 



||*n,fe:n(M)-*n.fe:«(M')llvo < Mp"'' 



M ( Qn,k-n (1)) M' f Q n.k:n (1) 



(20) 



The proof of Theorem [T] is postponed. It involves the bi-variate drift functions identified in the following 
proposition. 

Proposition 1. Assume and and let V, do and A be as in (J^- Then for all d > do, and all 

X G (A, 1) there exists d > d, and for each n d N, there exist 

1) a collection of functions {Ki,fe;0 < k < , with each Vn^k : X x X — > [1,cxd) and 
Vn,Ux,x') = ^[V{x) + V{x')]; 

2) collections {e„.fe; 1 < fc < n}, and {hny,Q < /c < n — l} depending on d and d, with each 
^n,k G (0, 1] and each bn,k G (0,oo); 

3) a collection of probability measures {I'n.k] 1 < P < n}, depending on d, with each 
vn,k e V{X); 

such that for I < k < n, 



S'„,fe(2;, •) A S'„,fe(a:', •) > en,kfn,k{')' ^{x,x')eC^, 



(21) 



K kVn,k{x,x') < XVn,k-iix,x') + bn,k-iicAx,x'), V(a;, x') G X X X, 



(22) 



where {Q := (a;, x') : V{x) < d, V{x') < d} , S^^j^ : X x X x B{X x X) ^ [0, 1] is defined by 



Sn,k{{x,x') ,d{y,y')) = 



Sn.ki^^ dy)Sn.k{x' , dy'), {x, x') ^ Cj, 
Rn.k{{x,x) ,d{y,y')) , {x,x')^C^ 



and Rn.k : X x X x BiX x X) — ;> [0, 1] is defined by 
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Rn.k i{x,x) ,d{y,y')) = — — {S„,k{x,dy) - e„,ki^„,k{dy)) {Sn,k{x' ,dy') - e„,fci/„,fc(dy')) • 

(1 - en,/c)^ 



Furthermore, 



inf inf e„ ^ > 0, and sup sup bn,k < +00. 

n>ll<fe<n ' n>10<fc<n-l 



(23) 



Proof. Throughout the proof, expressions featuring indices n and k hold for all n G N and I < k < n, unless 
stated otherwise. 

Fix d > do and A G (A, 1). Then let Sd, bd and Vd be the corresponding constants and niinorizing measure 
from (i^. Let K {i^d, A, V, bd) be the constant of assumption corresponding to i^d and bd, and set 



Ed {~X-X)Kiiyd,X,V, 



1 



V d. 



Then for equation (21 1, under assumption (Ap3l, there exists ej and i^d such that 



Sn,kix,A) > Ed ■ Vd \ Qn,k:n{^)lAj = Q ' Vd [Qn,kA^)) ST" 

^d ( <3n,fc:n(l) ) 



for all X E Cj. Then setting 



in,k ■- £d ■ 



^d {Qn,k:n{^) 

establishes (21 1. The first part of (23 1 is an immediate consequence of (i^Q and (j24| 



(24) 



• Vd (<9«,/c:n(l)) > £dK {Vd^ A, V, bs) . 



(25) 



Let {Ki,fc} be as defined in (15). Consider the collection of bi-variate drift functions {Ki,fc;0 < fc < n}, with 
each Vn,k : X x X — >■ [1, 00) defined by 



K.fc(a;, x') := - [Vn^ix) + Vn,k{x')] 



(26) 
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We now proceed to establish the bi-variate drift condition of equation (22 1. First, following the same 
arguments as in the proof of Lemma [T| under (-A|3| , 



Sn,kVn,kix) < XVn,k-lix) +bd- 



{Qn.k:n{^) 



Vx e X. 



(27) 



From (27 1; for ^ we have 



^n.k^n.ki^i ^ ) 

< ^ [Vn,k-i{x) + Vn,k-i{x')] + (A - A) {d+ 1) i [Ic,(a;) +Ic,(x')] 



< ^ [Vn,k-l{x) + V„,k-lix')] + (A - A) i [Vn,k-l{x) + Va,k-lix')] [lc,ix) + Ic,-(x')] 



< XV„^k-i{x,x'), 



where for the first inequality has been applied, the second inequality is due to the definition of d and 
the penultimate inequality is due to the definition of Vn,k-i- For all (a;, x') G Cj. 



^n.k^n.kix, X ) 

= Rn,kVn,ki^,x') 
1 



2(1 - e„,fc) 
A 



[Sn,kVn,k{x) + Sn,kVn,k{x') - 2e„,fet'„,fc (K,fe)] 



- ^".'^^ 2(1 - ln,k)£dyd (Qn,fc:„(l)) 



Ad 



(1 En.fe) inf^^y(^)<jAf„^fc ((5„,fc:„(l)) (a;) (1 - >in,k)£dyd (Qn,fc:«(l)) 



(28) 



where equation (27 1 has been used. This concludes the proof of equation (22). Applying (i^4| to the 



denominator terms in (28 1 and using (24l-(25l establishes the remaining part of equation (23 1 



□ 



Proof. (Theorem^. Let do and A be as in (Ajsj). Set d > do, A € (A, 1) and let d, {e„,fc}i {bn.k}i 
{Rn,k}, and{t4,^fe} be as in Proposition [l] The latter verifies conditions (NSl) and (NS2) of Douc et al. 



|14| . Consequently [14^ Theorem 8] may be applied. For a = 1, the uniform bounds in equation (23 1 of 
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Proposition [T] combined with standard manipulations of the bounds of Douc et al. |14l Theorem 8] (details 
omitted for brevity) show that there exists a finite constant M and p < 1 such that 



||M'S'n,fe ■ • ■ »S'rt,n IJ'Sn,k ■ ■ ■ Sn,n\\Y 

< Mp"^^+1 [fi {Vn,k-l) + t^' iVn,k-l)] 



(29) 



Noting that from equation (|7|, 



M (Qn,k:ni^) 



equation ( 20 ) holds due to ( 29 1 and the definition of Vn.k-i given in equation (151. For the case a e (0,1), due 



to Jensen's inequality and the fact that for any two non-negative reals a, b and a E [0, 1], (a + 6)" < a" + 6", 
we have that whenever equation (22) of Proposition [l] holds. 



Sn,k{VnM) {^^^') < {XVn,k~l{x,x') +bnM-lh{x,x'))°' 

< A"V;Vi(^>^') + &",fc-iItc(2:,x'), Wix,x')eXxX. 



and for Vn.k~i given in equation (26 I 



The arguments as for the case a = 1 are then repeated essentially replacing Vn.k, A, bn^k-i by V"i., A" and 



bn k-i respectively, in order to establish equation (19 1 and thus (20). The details are omitted for brevity. □ 



3.1 Verifying assumption (A|4]) 



The following lemma illustrates that (A|4| can be verified under a generic condition on the decay in x of the 
potential functions {G„^fc} specified via {C/„,fc}, relative to the drift function V of assumption (A[3|. 

Lemma 2. Assume and (J^- Let V, do and A be as in (J^ and assume 

Un.k{x) 

sup sup sup———— < +00. 
n>l 0<fc<n-l xeX 
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For any d > da let bd be the corresponding constant of (^5^. Then there exists a positive, finite constant C 
depending only on A and bd such that for any fi G 'P(X) with fJ.{V) < +00, 



inf inf ^ ( '9«,fc:n(l) ) > cxp [-C fi{V)] . 

n>l 0<k<n \ J 



Proof. Firstly, fi (^Qn,n:ni^)^ — and by Jensen's inequality, 



G„,„_i > exp 



>exp [-~fi{V)\\Un II yj 



For l<fc<n— 1, by Jensen's inequality. 



(Qn,fc:n(l)^ = / exp I '^Un,eixi)\ ^J.idXk) Mnx{x l-i, dxi) 

> exp / \y^Un.e{^e)]Kdxk) TT Mn/{xi_i,dxe) 

n A-^+i \tt J i=k+i 

1 1 ""^ f 

n n ^ — ' /y 

Iteration of the drift inequality in shows that for any 1 < k < £ < n, 



„ e-k-i 
V{xi)Mn,k-A^k,dxe) < X'-''V{x)+bd V X' 



It follows from ( 32 1 that 



Un,l{^t)l^^'^n,k-l-l{dxi) < \\Un,e\\v V {xt)^lMn,k~l:t{dxi) < fJ,{V) + ^ _ 



which combined with (31 1 implies the desired result. 



(30) 



(31) 



(32) 



□ 



4 hp error bounds for the particle measures 

Making use of the results of section [sj the following theorem presents an Lp bound on the error rj^nif) ^ 
Vn,n{f), for some possibly unbounded /. This theorem rests on assumptions about the moments of the 
mean particle drift, Vnki^)' ^ related normalization quantity, which are used in the proof to bound 



the moments of Martingale increments associated with the particle approximation. Discussion of the (34 1 is 
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given in the appendix and both assumptions are verified in the appHcation of section [Sj 

Theorem 2. Assume (jQ, (j^ and (jQ. Let V be as in (^^^ and for s > an independent parameter 
let t :— ii^. Let p > 1 and a G [0, 1], be such that atp < 1 and (1 + s)p < 1, and for G P(X) assume 



sup sup sup Ep 



Af>l n>l l<k<r. 



ln,k (Qri,fc:n(l 



-{l+s)p 



< +00, 



sup sup sup [t]^^^ < +00. 

Ar>lra>ll<fc<n 



(33) 
(34) 



Then there exists p <1 and a finite constant C depending on a, /i, V, and the constants in (^^jTp, (^^^, and 
(j^ such that for any f G £ya, n G N and N £N, 



E„ 



i/p 



«n -'?«.«)(/) rj < CWfWyJ- 



1 - p" \ 1 



N 



(35) 



Proof. Throughout the proof C denotes a constant whose value may change on each appearance. Consider 
the telescoping decomposition 



«n - Vn.n) (/) = ^ t*"^^- i^^) " '^n,k:n ($„,/c «fe^l))] (/) 
fc=0 



(36) 



with the convention ^n,o {Vn-i) P- For any of the terms in the summation of equation (36 1, following 
the approach of Del Moral [6, page 245] , we have 



where 



^n,k {Vn,k--lj Qn,k:n 



rin,kQn,k:n{l) $„,fc {vn,k~l) Qn,k:n{l) 
''ln,kQn,k:n — Vn,kQn,k:ni^) 



if) 



Vn,kQn.k:nil) 



Vn,kQnMn{'^) 





Qn,k:n 






Qn,fc:n(l) 



if) 



[<k-'^>n,k {vZk^l)]QZk:nif). 



k:nif)ix) ■■= Qn,k:nif)ix) - Qn,fc:n (1) (a:) 



*".fe (Vn.k-^ Qn,k:nif) 
l)(x) ^ 



^nMivZk-l Qn,k:nil) 



(37) 
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From equations ( 36 1 and ( 37 ) , for p> 1 



E 



[Mn 

n 



fc=0 



Vn.n) (/)| 
1 



1/p 



1/p 



< 



fc=0 



VZkQnM-.nW 



(l+s)p- 



l/(tp) 



, (38) 



where Minkowski's and Holder's inequalities have been applied. We next proceed to bound each of the 



factors in the summands of equation ( 38 ) 
Denoting 



1 ^ 



where 



(with the dependence of T^\. on N suppressed) we have that for any ngN, I < k < n and 1 < i < iV, 



J" 



■{N,i-lj 



n^k 



= 0, 
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with the convention that J^lfjf^ = -^lik^i ■ Next, there exists a constant C such that for any p > 1, 



E,, 



T, 



n.k 



i/p 



n.k:n {^)Sn,k+l ■ ■ ■ Sn,n{f) 
(Vn.k-l) (QnM:n{l) 



1/p 



n.k 1 Sn,fe ) 



^n,k (<fc_i) (Qn,fe:n(l) 



1/p 



+p"-^M||/||^„ E 



1/p 



?n,fc:n(l) ( t 



<p"-^M||/||,,„ E^ 



+p"-'=M||/||^„ E, 



1/p 



n,k:n (1) 



1/p 



^.P (^(^^.)) 



E„ 



< 



.k {Vn,k~l) (Qn,fe:n(l) 



(NA-l) 



1/p 



<p"-'=C||/|| 



(39) 



where Theorem [T] followed by Minkowski's inequality, Jensen's inequality, the exchangeability property of 



equation ([8|, Jensen's inequality again and the assumption of equation (34 1 have been applied. Thus for 
fixed N, {(E:=irS,-Ff/' 



l<i<iv|isa Martingale sequence with increments bounded in Lp. It 
follows that when tp > 2, by the Burkholder-Davis inequality and Minkowski's inequality, there exists a 
constant C such that 



E„ 



tp 



l/(tp) 



< CN-^E,. 



N 



i— 

< CN-^ ( ^E 



tp/2- 



i/(tp) 



\i=l 



(i) 



tp 



2/{tpy 



1/2 
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and when 1 < tp < 2, using the fact that for any a,b> and < r < 1, {a + by < + V , 



E,, 



tp 



l/{tp) 



N 



tp/2- 



l/(tp) 



T 



tp 



i/(tp) 



Combining with ( 39 ) , we conclude that there exists a constant C such that 



E„ 



Kk~^n.k {Vn,k~l)]Qn,k:ni.f) 



tp 



-{tp/2A(tp-l)}/(tp) 



for all n > 1 and 1 < k < n. 



The remaining terms in ( 38 1 are treated directly by the assumption of equation ( 33 1 , and therefore upon 



returning to ( 38 1 we conclude that there exists a constant C such that 



E, 



iVn,n-Vn,n) (/) | 



and the result holds. 



□ 



5 Application 

In this section is concerned with the case in which X = M.'^, B{R'^) is the corresponding Borel cr-algebra and 
throughout consider the following structural definitions and assumptions. 

• Let TT G 'P(X) be a target distribution admitting a density with respect to Lebesgue measure. Also 
denote by tt its density. In applications of interest, this density will only be known up to a multiplicative 
constant, Z, and denote by tt the unnormalised density, i.e. tt{x) — tt{x)/Z, x G . 

• For 7 e (0, 1] a constant, let 7 : [0, 1] — > [7, 1] be a non-decreasing, Lipschitz function. 

• LetjTT^; 7 € [7, 1]} be the family of probability measures defined by 



/^7r^(a:)da: 
/x7fT(x)dx 



, Ae6(X). 
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• Let q G 7^(X) be an increment distribution admitting a density with respect to Lebesgue measure, also 
denoted by q. For each n > 1 and 1 < fc < n, let M„_fc be a random walk Metropolis (RWM) kernel of 
invariant distribution i^^^k/n) s-^id proposal kernel q , i.e. 



-7(fe/n) 



1 A 



{x + y) 



7fT'(fe/n)(a;) 



+ 5, (A) 



1 A 



f'^('=/")(x + 2/) 



7f7(fe/")(x) 



where for any set C, C — a; := {z G X; z + a; G C} (note that in applications it may be of interest to 
allow q to depend on n and fc, for example via j{k/n), but for simplicity this issue is not pursued 
further here). 

Let {Gn.k', n > 1, < fc < n — 1} be a collection of potential functions defined by 



Gn.kix) = exp 



'llog.(.)fl«^±A)Z'^l^^ 
n \ l/n 



Consider the following assumptions on the target density tt and increment density q. 

• The density tt is strictly positive, bounded and has continuous first derivatives such that 



lim sup n{x) ■ Vlog7r(x) = —oo, lim sup n{x) ■ j < 0, 

''^°°\x\>r ''^°°\x\>r |v7r(a;)| 



(40) 



For all r > there exists > such that 



|a;| < r q{x) > tr 



(41) 



The assumptions of equations (40l-(41 1 are standard types of assumptions ensuring geometric ergodicity of 



RWM kernels [261 EU] . The assumption of equation (411 is stronger than the standard one in [501, but is 
flexible enough to verify (^[3]) which involves a family of minorization measures/constants, indexed over a 
range of levels of V . 

The interest in the specific FK models of this section arises from the choice of the initial distribution 
addressed in the following lemma. This FK model corresponds to a particular choice of the "backwards" 
kernels in ^llj . and in the corresponding SMC algorithm the order of the weighting and resampling steps is 
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reversed. 

Lemma 3. Consider the operators {'^n,k\ associated with {Gn,k\ md {Mn^k\ of section^^ Then for all 
n > 1 and <k <n, $„,o:fe {j^^} ^ ^i(k/n)- 

Proof. Fix n arbitrarily and suppose the result holds at rank < A; < n. Then 



^n,0:fe+l (tT^) (^) = *n,fe+l O *n,0:fe (tT^) (-4) 



^7((fc+l)/«)(a;) 

^7((fe+i)/")(a.) ~ 

7i"7((fe+i)/„)(^), AeS(X), 



due to the property that Mn^k is invariant for 7r^(fc/„)((ix). The proof is complete upon noting that for all 
"■I '^n.a-.a = Id, by convention. □ 

We have the following result. 

Theorem 3. Consider the collection of FK models specified in section [5[ Let s > be an independent 
parameter and set t — ■ Let a € [0, 1], and p > 1 be such that apt < 1 and (1 + s)p{l — 7)/7 < 1. Then 
there exist finite constants Ci(p, /i), and C2 ^/i,7r-y^ (depending implicitly on n and 7(-)J; o.i^d, constants 
(3 e (0, 1) and p e [0, 1), such that for any f e Cya, n > 1 and N > 1, 

where V{x) cx t:^^^{x) and for each n, :— rj^^- 

The proof of Theorem [3] is postponed until after the following proposition regarding the verification of 
assumptions. 

Proposition 2. Consider the setting of section^ Then (^7J), hold. 

Proof. As the density tt is bounded and 7(-) is Lipschitz, holds by the mean value theorem. We now 

turn to the verification of (A|3|. Various arguments are adopted from Andrieu et al. [1_ and the manipulations 
are fairly standard, but are included here for completeness. The main difference is that we need to explicitly 



22 



verify the drift and minorization conditions of (A|3| which hold over a range of sub-levels for V and the proof 
below involves verification of some assumptions taken as given in (TJ Lemma 4] . 



Firstly, due to the definition of n-y we have that Vlog7r^(a;) = 7Vlog7r(x) — 7Vlog7r(a;) and 



7f(a;)Vlog7f(a;) 7 7f^'^(x) V7r^(a;) 



V7r(x) 



|7r(a;)Vlog7r(a;)| 7 7r2^ ^(x) 



V7r^(a;) 



and so 



lim sup n{x) ■ Vlog7r-y(a;) 

'■^°°|a;|>r 



V7r^(a;) 



— oo, lim sup n{x) 



< 



(42) 



In order to verify (AjSj) we first verify a drift condition for Mo{x,dy), defined to be the RWM kernel 
reversible w.r.t. 7r^(x) with increment density q as above, i.e. 



Mo{x,A) = 



1 A 



7r^(a; + y) 



■K^X) 
1 - 



<z(2/)AL<=^dy) 



^ ^ -^-{x + y) 

7f21(.T) 



q{y)\^'^'^{dy), AeB{X). 



Let 13 e (0, 1) and define V" : X ^ [1, +oo) by 



V{x) := 



inf, TT-l^ix)' 



(43) 



The results of Jarner and Hansen [20] show that when (42 1 holds, then for Mq with increment density q 



satisfying equation (|4l|), it holds that lim^-i-oo sup|„|>„ ^ ^' < 1. Thus there exist A < 1 and px < +oo 
1 r II- V[x) 



such that 



\x\ > px^ ,w s < A. 



V{x) 



(44) 



Due to (40 1, there exists e > and Pt > such that 



> Pe => 7i(a;) • V log7r-y(x) < — e. 



Now set To = Pa V pc and do := sup|^|<j,^ ^i^)- Note that do < +oo due to the definition of V and as the 
density tt is continuous and positive. We now proceed to verify the drift part of (Al3l. 
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For any d > do let := {x : V{x) < d}. We then have 



sup AfoV^(x) <dsup^gc^— — — 



xeCa \ Ja{x) tt ■^'^{x) 



R{x) 



7r^{x) 



TT 



7(1-/9) 



(x) 



g(y)A^<'b(dy) (45) 



where A(a;) := {y G X : 7r(a; + y) > 7r(a;)}, R(x) := {y G X : 7r(a:: + y) < tt{x)}. As each of the ratios m (451 
is less than or equal to 1, then we conclude that there exists a constant Ct < +oo such that 



sup MoV{x) < dCb =: bd 



xeCd 



(46) 



Noting the definition of do, and combining (44 1 and (46) we obtain 



MoV{x) = MoV{x)I[\x\ > ro] + MoV{x)I[\x\ < ro] 

< XV{x) + MaV{x)I[V{x) < d] 

< XV{x) + bdIc,{x), 



(47) 



for any d > do and x G X. The arguments of Andrieu et al. [TJ Lemma 5] then give M„jtF(x) < MoV{x) 



and from this, (47 1 and (46), we obtain for any d> d[ 



sup sup Mn,kV{x) < XV{x) + bdlcaix), 

ri>l l<k<n 



which establishes the drift part of (i^. It remains to show the minorization part. To this end we first 
show that for any d > do, is bounded. Recalling the definition of tq, we have that for any x such that 

- ro > 0, 



V{x) 



V{n{x)ro) \TT{n{x)ro) 



7r{x) 



-7/3 



exp 



7/3(1x1 -To) / n{x) -Wlognitx + {I - t)n{x)ro)dt 



> 



exp [7/3(|a;| - ro) > 



from which we see that limr_>.oo inf|a;|>r ^(2;) = +00, which in turn implies that for all d > do there exists 
rd > such that V{x) < d ^ \x\ < r^. Then for any n > 1, < fc < n and > 0, whenever a; G 
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A-oi 



1 A 



> 



7r(x + y) 
Tr{x) 

1 A 



7r(x + y) 



£21- 



(AnB{0,ra)-x) L ^(^) 

infygi3(o,3rd) ■^jy) f 
infyes(o,3rd)7r(y) 



--: ed-Vd{A), 



where the third inequahty holds due to the properties of (7 in (41 1 and because A n -6(0, r^) n B{x, 2rd) 



An B{0,rd) whenever x £ B{0,rd). As tt is strictly positive and continuous, V is bounded on compact 
sets and therefore i^diV) < +00. Also, I) Cdg and then due to the definition of do, Vd{^d) > Vd{Cda) > 
i'd{B{0,rQ)) > 0. This concludes the verification of (A|3|. 

For (A|4|, from the definition of Gn,k, we observe that Un.k is defined by 

Un,k{x) = logTT x — - C^suplog7r(y 

V V" J V 

where C-y is the Lipschitz constant for 7(-) and we observe that sup„>]^ supq<j,<„_]^ sup^gx ^ < +00 



Assumption (-A|4]) is then satisfied upon application of Lemma [2j 



V{x) 



□ 



Proof. (Theorem^ 

Throughout the proof, we denote by C a constant whose value may change upon each appearance. 
Consider the error decomposition 



^) (/)| 



1/p 



E,, 



iVn,n-Vn,n) (/) T 



1/p 



+ \{Vn,n-7T) (/)| 



(48) 



Choose /3 such that (l + s)p(l — 7)/ (7/?) < 1 and take V to be defined as in equation (43). By proposition 



[2j the FK model of section [s] satisfies assumptions (i^, and {J^. The second term on the r.h.s. of 

( [48| is treated by application of Theorem [l] Noting that by definition, ri„,n = ^n,0:n (m) ^^id by Lemma |3] 
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= ^n,0:n (""7) obtain from Theorem jlj that there exist constants p, M and C2 such that 



where the constant C2 arises from assumption {J^^ appHed to the denominator terms and imphcitly depends 
on n-y, V, and the constants in (-A|3j). 

In order to apply Theorem [2] to the first term on the r.h.s. of (48 1 it remains to verify the assumptions of 



equations (33)-(34l. We start by addressing the latter. From the definition of V and due to the assumption 



that 7(-) is non-decreasing we observe that for all x,x' e X, 

[G„,k{x)-Gn,k{^')][V{x)~V{x')] < 

and therefore by Lemma |4] for all (possibly random) 77 G VQCj, ri{Gn,kV) < rj {Gn,k) V i^)- Then for any 
n > 1 and 1 < fc < n. 



(N,N) 



n,k — l 



< A 



Vn,k-1 i.Gn,k-l) 



+ bd„ <XVn,k-liV)+bdo, 



(49) 



where (A[3| has been applied with do is defined below equation ( 44 ) in the proof of proposition |2] Stan- 



dard iteration of the particle drift inequality ( 49 1 (details omitted for brevity) combined with the fact that 
l^no'*''' * — 1) ^I's ^I's independent and each distributed according to /i shows that 



sup sup sup [77,^^. (V)] < +00, 

Af>ln>ll<fe<Ti 



(50) 



and noting that apt < 1. equation (34 1 then holds by two applications of Jensen's inequality . 



We now turn to the verification of equation ( 33 1 . From previous considerations we notice that for some 
finite constant C, 



Ur.,k{^) = - '[^^ ] log V{X) + G 



and therefore for any 
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'n,k (<3™,fc:n(l)) 



exp I C'+ ^^2_^ ( 77:^ ) / ^OSVix3)Vn,kMn,k:jidXj 



l/n 

'/»)- 
l/n 



yiXj)Vn,kMn,k:jidXj] 



<exp(C+M^log[<,(y)] 



< 



/37 



where the penultimate inequaUty hold due to standard iteration of the drift inequality in (Apl. Therefore 



Vn,k 



(Q„,/c:n(l)) 



-{l+s)p- 



(1-7) 

due to Jensen's inequality and where q :— p~ (1 + s)p < 1 by assumption of the theorem. Equation (33 1 



then follows upon combining this with equation ( 50 I . This completes the proof. 



□ 
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6 Appendix 



When (i^[3|) or even more simply (A[2|) holds, it is natural to ask under what further conditions, if any, does 
the non-homogeneous Markov chain |ci^'*; < fc < n| also satisfy a geometric drift condition, as this would 



be one natural route to verifying (34 1. Defining Vn ■ X — > [l,oo) by 
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Vn{0 



1 ^ 



where V is as in {M2^ and C = (■f^, we see that 



< A 



Vn,k-1 (^M: 



+ b- ^ ^ 



< A 



<fc-l [Gn,k-lV 
Vn,k-1 (Gn,fe-l) 



(51) 



where 



CW := {C = {e, e : 3j e {1, iV}; e C} 



We are then faced with the issue of whether the re-weighting of 'Hnk-i t'Y the potential function destroys 
the geometric drift of Mny. for example one may ask when is it true that for some fixed 5 > 0, 



n 



Gn.k-lV 



< il + S)v^,k-i{V) 



Vn,k-1 (G„,fe-l) 



(52) 



The remainder of this section examines this question and goes on to look at some particular issues when 
X = M'^. First, we have the following Lemma, which addresses a general scenario. 

Lemma 4. For / : X — > (0, oo), 5 : X — > (0, 00), two measurable functions and S G [0, 00), 
for any r] e 'P(X) such that \t] {fg)\ < +00, \t] (f )\ < +00, 177 {g)\ < +00, if 



[f{x)-f{x')][g{x)^g{x')] 
{fi^) + fi^')]W)+9i^')] 



< 



2 + S' 



y{x,x') e x^ 



(53) 
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and only if 



[fix)- f{x')][g{x)-g{x')] ^ ^ V(rr')GX^ 
{f{x) + f{x')][9{x)+9{x')] -2 + 6-^ ^ ' ' 

Proof. For any r], f, g as specified in the statement and e e (0, 1), consider the identity: 

(i-e)n{fg)-{i + e)ri{f)n{g) 
1 



X2 



{[f{x) - fix')] [gix) - gix')] - e [f{x) + fix')] [gix) + gix')]) vidx)vidx') 
iifix) - fix')] [gix) - gix')] - e [fix) + fix')] [gix) + gix')]) r,idx) rjidx'), 



where the final inequaUty is due to Fubini's theorem, which is applicable under the hypotheses of the lemma. 
The sufficiency part then follows directly upon setting 

For the necessity part, suppose on the contrary that there exists iy,y') € such that 

[fiy)-fiy)][9iy)-9iy)] > ^^[fiy) + fiy)][9iy) + 9iy)], 



then setting rj = ^[5y + 6y'] and e = 2^ > 0, we obtain 



/ / ([/(^) - /(^')] [9ix) - gix')] - e [fix) + fix')] [gix) + gix')]) r,idx)vidx') 

= I i[fix) - fix')] [gix) - gix')] - e [fix) + fix')] [gix) + gix')]) - [6yidx) 5yidx') + 5y.idx) 5y.idx')] 

Jx^ 4 

+ / ([/W - fi^')] [ff(^) - - e [fix) + /(^')] [9ix) + gix')]) \ [5yidx) ® Sy-idx') + dy,idx) ® dyidx')] 

JX^ 4 

= -^[fiy)9iy) + fiy')9iy')] 

+\ i[fiy) - fiy')] [9iy) - 9iy')] - e [fiy) + fiy')] [giy) + giy')]) 

> \ i[fiy) - fiy')] [9iy) - yiy')] - 3e [fiy) + fiy')] [giy) + giy')]) > 0, 

which completes the proof. □ 
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Note that the sufficient condition is always met, for example, when g = o f for some positive, strictly 
decreasing and invertible function ip. The factor of 3 in the necessity part of Lemma ^arises from considering 
a suitable 77 with N = 2 support points and is illustrative in the case of interest where each of the measures 
rj^f. is atomic. The following section further explores the necessity part of Lemma jJ] in the case of X = M"*, 



which is of interest in the context of The purpose of this next section is to show that in order for (52 1 
to hold for any rj^ it is necessary that there is a very specific relationship holds between G and V . 

6.1 When X = W^ 

Let X = M'' and let /B(X) be the corresponding Borel cr-algebra. We denote by S''^^^ the unit sphere in US'* 
and for z in the image of V (resp. G), define the contour manifolds 



:= {a; e M'' : V{x) = z] and := {x G M.'^ : G{x) = z} . 

We denote by B{c,r) the closed ball of radius r and centered at c. For x,x' e M.'^, let n{x) :— |^ and let 
^x'x be the line segment with end points x and x'. Consider the following collection of assumptions. 

(A5) — G : M'' — )• (0, +00) and F : M'' [1, +00) have continuous first derivatives 

— There exist tq > and strictly positive and non-decreasing functions (j)v '■ (0, co) — > (0, co] and 
(j)a '■ (0,00) — > (0,00] such that whenever |x| > tq, 

n{x) -VlogVix) > (t)v{\x\) and n{x) ■ V \ogG{x) < ^(j)G{\x\) (54) 



The conditions of equation (54) imply that for all 5 > and whenever \x\ > tq, 



V (x + Sn(x)) , „, G(x + 6n(x)) , „, , ^ 

^ y^^^^ " > e^p[q>v{\x\)S], and ^ ^^^^ ^ " < exp . (55) 



The assumptions of ( 54 1 are largely inspired by assumptions on probability densities used to verify geometric 
ergodicity of certain Metropolis-Hastings kernels |26l EU] and are realistic in the context of [11 . It follows 
immediately from the arguments of Roberts and Tweedie [55', proof of theorem 2.1] transferred to G and V, 
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that for large enough, the contour manifolds of G and V which contain x are parameterizable by the unit 
sphere in the sense that: 

• For each z > sup^y^^^^ ^iu) there exists a bijection between S'^^^ and Vz, and for each z' < inf|j,|<r(, G{y) 
there exists a bijection between 3"^^^ and Qz', such that, 



Vz = {wziOC ■■ C G S"-'} , and Gz' - {hziCK ■■ C € ^'"'} , 



(56) 



where Wz(-) and hz{-) are positive and continuous functions on S''' ^. Furthermore, Vz H i3(0,ro) = 
G,.nB(0,ro) =0. 

In order to describe the relationship between contour manifolds of V and G which intersect at some point, 
we introduce the function t/j : M"^ — > [0, oo] defined by 



ip{x) := sup {\hG{x){C)\-\wv{x){C)\) 



(57) 



which implicitly depends on G and V. We have the following proposition. 
Proposition 3. Assume G : X ^ (0, oo) andV -.X-^ [1, oo) satisfy (jQ and (j^ with 



lim 



inf 0v(s) 

s>r 



inf 0g(s) 

s>r 



inf ip{x) — +00. 

|a;|>r 



(58) 



Then for any < (5 < 1, f/iere exists an atomic rj G 7^(X) sitc/i f/iai 



77(G) 



> {l + S)viV). 



Proof. In outline, the proof involves showing that if the condition in (58) holds, then for all e G [0, 1), there 
exists (y,y') G such that 

f Giy)~G{y') \ ( V{y)-V{yr 



\G{y) + G{y')J \Viy) + V{y') 



and then employing Lemma ^ 



Firstly, suppose (58 1 fails to hold. Then fix arbitrarily e G [0, 1) and let r > tq be large enough that 



inf cj)v{s) 

s>r 



inf 0g(s) 

s>r 



inf ip{x) > 2 log 

|a:|>r V 1 
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and then let y be any point such that \y\ > r, V{y) > sup|„|<^y(u) and ipiu) > (such r and y exist 



due to the hypotheses of equation (54) and the assumed failure of (|58|). Then recalling the definition of 
tp in (57l, and as, for fixed y, hQ(^y-^{ ) and are continuous functions, there exists C G S'^^^ such 

that |/iG(y)(C)| ^ |^v(i/)(C)| = ^{y) > 0. With a slight abuse, let x :— /iG(y)(C)C; 2;' := wv{y){OCi £^nd let 
y' := -^{x + x'). It follows that the line segment l^'x lies on a ray and |x| — \y'\ ~ \y'\ — \x'\ ~ ip{y)/2 > 0. 



Observe that under the implications of (54 1 stated before the proposition, by construction V{x) > V{y') > 
V{x') = V{y) and G{x') > G{y') > G{x) ~ G{y). It must also be the case that |a;'| > r, as otherwise 
^(y) = ^{^') ^ sup|„|<^ contradicting the definition of y. The situation is illustrated in Figure jlj 



Due to the hypothesis of equation (55 1 and the definition of r, we then have 



V[y') V{y') 



< exp 



V(lx'l)^ 



< exp 
,1 

< 



inf (t>v{s) inf ipiu) 

s>r |M|>r 



1 + 



(59) 



and similarly. 



Gjy) ^ G{x) 
G{y') G{y') 



< exp 



^c(ly'l)^ 



< exp 
,1 - 

< 



inf (pais) inf '>p(u) 

s>r \u\>r 



1 



(60) 



It follows from (59l-(60l that 



1 - 



vjy) 

V{y') 



Vjy) 

V{y') 



> Ve and 



Gjy) 
G(y') 



Gjy) 
G(y') 



The proof is complete upon noting that e was chosen arbitrarily in [0, 1) and applying the necessity part of 
Lemma ID □ 

We may then formulate the following example. 



Corollary 1. Let X = M^. For any e > , let G and V be defined by 
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Figure 1: Solid line: Gaiy)- Dashed line: Vv(y)- Radial distance of ip{y) outwards from Gaiy) is indicated. 



V{x) = exp ( (xi — e 



and 



G{x) = exp ^- {xi + ef + x\^ 



where x = {xiX2)- Then for any < (5 < 1, there exists an atomic rj G 'P(X) such that 



77(G) 



> (1 + S)7]{V). 



(61) 



Proof. Elementary manipulations show that (i^ holds with tq = 2e and taking 0g(|x|) = (|a;|) = 2(|a;|— e) 



for \x\ > tq. For any r > tq, consider the contour manifolds Vz and Qz' for z — V(0,y/r 



and 



z' = G (0, -y/r^ — e2j^ jg straightforward to check that -0 ((O, ^r'^ — e^)) = 2e and the result follows from 
Proposition [3j □ 

This example serves to highlight that a very specific relationship between the potential functions and the 



drift function is required if inequalities of the form (52) are to hold for (1 + (5)A < 1 and for all probability 



measures. The application of section [5] is one situation where such a relationship holds. We note that it 
is possible to bound 77^ {V) without confirming (52 1, by appealing to convexity and the flattening 



property of the potential functions combined with the geometric drift; but the application of section |5] will 
not need such an approach and so we do not report these details here. On the other hand, if inequalities 



of the form (52 I do hold with (1 + 6)X < 1, one might then pursue accompanying minorization conditions 
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for the chain |Q fe ; < < n|, but it seems difficult to achieve this in such a way that the minorizing 
constants do not degrade as N increases. 
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